Minimal Cost Attribute Reduction through Backtracking

نویسندگان

  • Fan Min
  • William Zhu
چکیده

Test costs and misclassification costs are two most important types in cost-sensitive learning. In decision systems with both costs, there is a tradeoff between them while building a classifier. Generally, with more attributes selected and more information available, the test cost increases, and the misclassification cost decreases. We shall deliberately select an attribute subset such that the total cost is minimal. Existing decision tree approaches deal with this issue from a local perspective. They benefit from immediately available test results, therefore objects falling into different branches may experience different tests. In this paper, we consider the situation where tests have delayed results. Since we need to choose a test set for all objects, the attribute reduction problem is defined from a global perspective. We propose a backtrack algorithm with three pruning techniques to find a minimal cost reduct. Experimental results indicate that the pruning techniques are effective, and the algorithm is efficient on a medium sized dataset Mushroom.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rules-based Classification with Limited Cost

In test cost-sensitive decision systems, it is difficulty for us to find an optimal attribute set and construct a quality classifier with limited cost. The minimal test cost-sensitive attribute reduction is proposed to address the former problem. However, it is inevitable to remove some good even better attributes in the minimal test cost-sensitive attribute reduction. As a result, the classifi...

متن کامل

Minimal cost feature selection of data with normal distribution measurement errors

Minimal cost feature selection is devoted to obtain a trade-off between test costs and misclassification costs. This issue has been addressed recently on nominal data. In this paper, we consider numerical data with measurement errors and study minimal cost feature selection in this model. First, we build a data model with normal distribution measurement errors. Second, the neighborhood of each ...

متن کامل

Constraint Sovling Engine based Nurse rostering with Intelligent Backtracking

Efficient utilization of time and effort is essential in Personnel scheduling problems to evenly balance the workload among the people and attempt to satisfy the personnel preferences. In Constraint Satisfaction Problem based scheduling problems, when a branch of the search fails the backtracking search algorithm back up to the preceding variable and try a different value for it. So here the mo...

متن کامل

Analysis of alternative objective functions for attribute reduction in complete decision tables

Attribute reduction and reducts are important notions in rough set theory that can preserve discriminatory properties to the highest possible extent similar to the entire set of attributes. In this paper, the relationships among 13 types of alternative objective functions for attribute reduction are systematically analyzed in complete decision tables. For inconsistent and consistent decision ta...

متن کامل

Test-cost-sensitive attribute reduction

In many data mining and machine learning applications, there are two objectives in the task of classification; one is decreasing the test cost, the other is improving the classification accuracy. Most existing research work focuses on the latter, with attribute reduction serving as an optional pre-processing stage to remove redundant attributes. In this paper, we point out that when tests must ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011